MODS: Multiple One-class Data Streams Learning from Homogeneous Data
نویسندگان
چکیده
This paper presents a novel approach, called MODS, to build an accurate time evolving classifier from multiple one-class data streams learning time evolving classifier. Our proposed MODS approach works in two steps. In the first step, we first construct local one-class classifiers on the labeled positive examples from each sub-data stream respectively. We then collect the informative examples (support vectors) around each local one-class classifier, which can support the decision boundary of the classifier. This is called support vector preservation principle. In the second step, we construct a global one-class classifier on the collected informative examples. By using the support vector preservation principle, our proposed MODS explicitly addresses the problem of building accurate classifier from multiple one-class data streams. Extensive experiments on real life data streams have demonstrated that our MODS approach can achieve high performance and efficiency for the multiple one-class data streams learning in comparison with other approaches.
منابع مشابه
One-Class Learning and Concept Summarization for Vaguely Labeled Data Streams
In this paper, we formulate a new research problem of concept learning and summarization for one-class data streams. The main objective is to (1) allow users to label instance groups, instead of single instances, as positive samples for learning, and (2) summarize concepts labeled by users over the whole stream. The employment of the batch-labeling raises serious issues for stream-oriented conc...
متن کاملCost Sensitive Online Multiple Kernel Classification
Learning from data streams has been an important open research problem in the era of big data analytics. This paper investigates supervised machine learning techniques for mining data streams with application to online anomaly detection. Unlike conventional machine learning tasks, machine learning from data streams for online anomaly detection has several challenges: (i) data arriving sequentia...
متن کاملScoring System for Multiple Organ Dysfunction in Adult Horses with Acute Surgical Gastrointestinal Disease
BACKGROUND The prevalence of multiple organ dysfunction syndrome (MODS) in horses with acute surgical gastrointestinal (GI) disease is unknown. Currently, there are no validated criteria to confirm MODS in adult horses. OBJECTIVES To develop criteria for a MODS score for horses with acute surgical colic (MODS SGI) and evaluate the association with 6-month survival. To compare the MODS SGI sco...
متن کاملAn adaptive ensemble classifier for mining concept drifting data streams
Traditional data mining techniques cannot be directly applied to the real-time data streaming environment. Existing mining classifiers therefore need to be updated frequently to adopt the changes in data streams. In this paper, we address this issue and propose an adaptive ensemble approach for classification and novel class detection in concept-drifting data streams. The proposed approach uses...
متن کاملLearning to Classify Data Streams with Imbalanced Class Distributions
Streaming data is pervasive in a multitude of data mining applications. One fundamental problem in the task of mining streaming data is distributional drift over time. Streams may also exhibit high and varying degrees of class imbalance, which can further complicate the task. In scenarios like these, class imbalance is particularly difficult to overcome and has not been as thoroughly studied. I...
متن کامل